Scalable, Efficient Range Queries for Grid Information Services
نویسندگان
چکیده
Recent Peer-to-Peer (P2P) systems such as Tapestry, Chord or CAN act primarily as a Distributed Hash Table (DHT). A DHT is a data structure for distributed storing of pairs (key, data) which allows fast locating of data when a key is given. To facilitate efficient queries on a range of keys, we propose a CAN-based extension of this DHT-functionality. The design of our extension suggests several range query strategies; their efficiency is investigated in the paper. A further goal is to enhance the routing aspects of current DHT-systems so that also frequently changing data can be handled efficiently. We show that some relatively simple approaches are able to reduce the communication overhead in this case. The design of the system is driven by its application as a part of the information infrastructure for computational grids. Such grids provide an infrastructure for sharing computing resources; an information infrastructure is their inherent part which collects resource data and provides search functionality. Our approach complements current solutions such as MDS-2 by adding self-organization, faulttolerance and an ability to efficiently handle dynamic attributes, such as server processing capacity. We evaluate our system in this context via a simulation and show that its design along with particular query and update strategies meet the goals of scalability, communication-efficiency and availability.
منابع مشابه
NodeWiz: Fault-tolerant grid information service
Large scale grid computing systems may provide multitudinous services, from different providers, whose quality of service will vary. Moreover, services are deployed and undeployed in the grid with no central coordination. Thus, to find out the most suitable service to fulfill their needs, or to find the most suitable set of resources on which to deploy their services, grid users must resort to ...
متن کاملEfficient Evaluation of Continuous Range Queries on Moving Objects
Abstract. In this paper we evaluate several in-memory algorithms for efficient and scalable processing of continuous range queries over collections of moving objects. Constant updates to the index are avoided by query indexing. No constraints are imposed on the speed or path of moving objects. We present a detailed analysis of a grid approach which shows the best results for both skewed and uni...
متن کاملRCT: A distributed tree for supporting efficient range and multi-attribute queries in grid computing
Resource discovery is of great importance in grid environments. Most of existing approaches treat all resources equally without any categorizing mechanism. We propose, Resource Category Tree (RCT), which organizes resources based on their characteristics represented by primary attributes (PA). RCT adopts a structure of distributed AVL tree, with each node representing a specific range of PA val...
متن کاملDesign and Specification of a Prototype Service / Resource
This deliverable presents the current state of the design and specifications of the XtreemOS component called “Service/Resource Discovery System (SRDS)”. The SRDS is a key component of the highly available and scalable infrastructure described in the deliverable D3.2.1 (Design of an Infrastructure for Highly Available and Scalable Grid Services) under the responsibility of WP3.2. The SRDS will ...
متن کاملScalable DHT- and ontology-based information service for large-scale grids
Current grid information services are centralized or hierarchical and prove inefficient as the scale of the grid rapidly increases. The introduction of the P2P DHT technique into grids brings an encouraging path. However, current applications of the P2P DHT technique to grids do not consider the Virtual Organization (VO) management mode of grid resources. Frequent joining and leaving of resourc...
متن کامل